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Abstract 

Non-local correlations are not only a fascinating feature of quantum theory, but an interesting 
resource for information processing, for instance in communication-complexity theory or cryp- 
tography. An important question in this context is whether the resource can be distilled: Given 
a large amount of weak non-local correlations, is there a method to obtain strong non- locality 
using local operations and shared randomness? We partly answer this question by no: sym- 
metric (or isotropic) CHSH-type non-locality which is not "super-strong," but achievable by 
measurements on certain quantum states, has at best very limited distillability by any non- 
interactive classical method. This strongly extends and generalizes what was previously known, 
namely that there are two limits that cannot be overstepped: The Bell and Tsirelson bounds. 
Our results imply that there must be an infinite number of such bounds. 

A noticeable feature of our proof of this — purely classical — statement is that it is "quantum 
mechanical" in the sense that (both novel and known) facts from quantum theory are used in a 
crucial way to obtain the claimed results. One of these results — of independent interest — is 
that certain mixed entangled states cannot be distilled at all without communication. Weaker 
statements, namely limited distillability, have been known for Werner states. 
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1 Motivation and Main Result 



1.1 Non-Locality 

What is non-locality? 

Linearity of quantum physics allows for a phenomenon called entanglement: The joint state of two 
possibly distant parts of a system cannot be described by the states of the parts. Even when the 
two parts are measured in a way that permits no signal transmission from one measurement event 
to the other — such two events are called space-like separated — , the outcomes can be correlated. 
Although such a correlation may be surprising from a physical point of view, it need not be in 
terms of information: The distant particles could have shared classical information since they were 
generated together. Actually, this is what motivated Einstein, Podolsky, and Rosen [11] in 1935 
to postulate that quantum physics was incomplete and should be augmented by this information, 
called hidden variables, determining the outcomes of all potential measurements. The true surprise 
came about 30 years later, when John Bell [5j showed that quantum correlation cannot be explained 
by shared classical information in principle. 

How can this be? Measurement outcomes are classical random variables, are they not? 
It can be if one looks at correlations of two-partite systems with a given input/output behavior 
instead of just random variables. This makes sense physically, too: the input corresponds the the 
choice of the measurement basis. 

A bipartite system consists of two ends, usually called Alice and Bob, at each of which an input 
(x and y) is given and an output obtained (o and b). There are different degrees of correlation 
between the two ends: A system is called signaling if it allows for message transmission between 
Alice and Bob, and local if its behavior can be explained by shared classical information. Now, 
Bell's theorem, which has been celebrated as one of the "most profound discoveries in science" 
(Stapp), implies that certain entangled quantum states behave, when measured, non-locally (but, 
at the same time, non-signaling). Non-local behavior is, hence, a reality; and an experimentally 
verified one. 

What is known about non-locality, besides that it exists? 

Non-locality is being intensively studied, not only as a fascinating phenomenon, but also as a 
resource for information processing. Many central properties remain unknown; an example being 
whether there is a method to "distill" or "amplify" non-local correlations. In this note, we give a 
partial answer to the question: A specific but central type of non-locality has at most very limited 
distillability. 

What can non-locality be useful for? 

Two examples: First, it can replace communication in certain contexts — although it does not 
allow for communication. Second, it implies cryptographic confidentiality. The fact that the 
outputs of a non-local system are not completely pre-determined as classical information implies 
lack of knowledge by any possible adversary [3j! Note that the resulting cryptographic security 
follows directly from the observed joint correlations, is device-independent, and does not even rely 
on the fact that the adversary is limited by quantum physics; it only depends on the non-signaling 
postulate of relativity. 

But then, why is the problem of non-locality distillation important and interesting? 
Generally, the stronger the non-local correlations, the more useful they are. For instance, it has 
been shown [8] that the existence of non-locality which is super-quantum to some extent would 
collapse communication complexity: Every Boolean function could be computed in a distributed 
way with just a constant number of communicated bits. A second example is from cryptography: 
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The confidentiality that can be derived from non-locahty is stronger if the non-locality is. It is, 
by these reasons, a natural question whether it can be distilled (see also [1] where non-locality 
distillation is mentioned to be an interesting problem which should be investigated). 

For which types of non-locality do the results apply? Are there other types which are distillable? 
We investigate a special type of non-locality, namely isotropic approximations to non-local boxes. 
The latter are idealizations of systems violating the most prominent Bell inequality called CHSH 
(after Clauser, Horne, Shimony, and Holt) . Isotropic boxes are symmetric and, therefore, an 
important special case. They are, actually, the worst case when it comes to distillability: If they 
can be distilled, then all boxes of the same strength can. Our result states that the distillability of 
such systems is, at most, very limited (and suggests that it is, in fact, zero). This complements a 
result of [12], stating that certain (non-isotropic) systems can be distilled. This may indicate that 
CHSH violation is a good measure for non-locality only in the isotropic case. 



1.2 Our Contribution 

The most important and prominent type of non- locality is CHSH-type non-locality [lOj . There are 
two main reasons why we investigate this type of non-locality. On the one hand, it is the only 
possible non-locality of systems with binary inputs and outputs. On the other hand, for binary 
input/output systems the non-locality can be quantified by a single number (see equation dS])), 
whereas for systems with more inputs/outputs it is not clear how to quantify non- locality by a 
single number and therefore, to find a meaningful definition of non-locality distillation for such 
systems. The maximal possible CHSH-type non-locality has been modeled as a Popescu-Rohrlich 
(PR)-hox [16] or non-local (NL) box. Such a PR-box has input bits x,y G {0,1} and output bits 
a,b £ {0, 1} on Alice's and Bob's side, respectively, and computes the relation 

X A y = a (B b . 

In other words, Alice and Bob have to output different bits if and only if both of them obtain 
1 as their input. For all other input pairs they have to output the same bit. Furthermore, the 
output bits a and b must be unbiased (see Section [27T] for a definition), otherwise the bipartite 
system is signaling. It has been shown that a system which has this behavior with probability 
> 75%, averaged over all input pairs, is non-local; on the other hand, entangled quantum states 
can achieve roughly 85%. We call a system a ^-approximation to the PR-box if it satisfies the 
above described behavior, averaged over all input pairs of Alice and Bob, with probability q. The 
question we address in this paper is: Given a number of g-approximations to the PR-box, is it 
possible to obtain, without communication, a p-approximation for some p > q? In other words, 
we investigate Non-Locality Distillation Protocols (NDPs for short), which are two-party protocols 
between Alice and Bob. An NDP takes as input x, y G {0, 1} and outputs a,b £ {0, 1} such that a 
PR-box is approximated with probability p. Alice and Bob are allowed to perform arbitrary local 
operations, have shared randomness, can use (7- approximations of PR-boxes (where q < p), but are 
not allowed to communicate (see Section [2.31 for a detailed description of NDPs). We will prove 
the following impossibility result on the existence of NDPs. 



Main Result. There exists no non-locality distillation protocol which uses isotropic q-approxi- 
mations of PR-boxes, and has the same input/output behavior as a p- approximation of a PR-box 
such that p > q + g{q) and 

3 ,2W2 

-^<q<^^, (1) 
where g{q) is the distillability gap (see Theorem\^. 
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This statement is the first result about non-locality distillation besides Bell's and Tsirelson's 
limits, which no algorithm can cross (Section [3] covers the known bounds). Why the restriction 
in the statement? The reason lies in a particularity of its proof: it is "quantum-physical" ; this is 
remarkable since the statement is purely classically An example of such a statement is that the 
fully entangled fraction — i.e., proximity to a maximally entangled state — of certain bi-partite 
states cannot be increased non-interactively. Such a result has not been known before for any 
(non-trivial) state. For a different class of states, the Werner states, it has been shown that the 
possibility of non-interactive purification is limited; for our states, it is zero. 

In a nutshell, our proof works as follows: 

• The proof is by contradiction: We assume that an NDP exists which uses a finite number of 
isotropic g-approximations of PR-boxes — for some q that is quantum-physically achievable, 
i.e., satisfying ([1]) — and has the same input/output behavior as a p- approximation of a 
PR-box, for some p > q + g{q)- 

• This classical NDP can be simulated by local quantum operations, where we think of the 
g-boxes as coming from measuring a certain mixed quantum state il^- 

• Measurements can be postponed to the end, and the measurement operators on Alice's and 
Bob's sides can be simultaneously block-diagonalized, in blocks of size at most 2x2. It is, 
therefore, sufficient to consider measurements on qubits only. 

• Using the following two facts, we conclude that the CHSH-type non-locality resulting from 
the quantum NDP is not much stronger than the original. 

1. It is impossible to obtain, from many copies of 0,a, a state with a higher fully entangled 
fraction (i.e., proximity to a fully entangled state). 

2. By transforming this result from a statement about the impossibility of entanglement 
distillation to a statement about the impossibility of non-locality distillation we lose 
something (but not too much, namely the distillability gap g{q)) as there is no one- 
to-one correspondence between non-locality (CHSH violation) and entanglement (fully 
entangled fraction). 

Note that the statement we prove resembles a similar result on correlations, i.e., the impossibility 
of non-interactive correlation distillation [18j: Without communication, no highly correlated bit can 
be obtained from an arbitrary number of weakly correlated bits. 

2 Definitions and Preliminaries 

2.1 Systems and Correlations 

In order to explain the essence of non-locality, we introduce the notion of two-partite systems, de- 
fined by their joint input-output behavior P{a, b\x, y) with inputs x ^ X , y ^ y and outputs a & A, 
b £ B (see Figured]). A party receives its output immediately after giving its input, independently 
of whether the other has given its input already. Depending on the concrete form of P{a,b\x,y), 
the input /output behavior can be simulated by a physical process, namely by measurements on 

^By a "quantum-physical proof" , we do not mean that whether the statement is correct or not depends on whether 
quantum theory is an accurate description of nature or not. We mean that we make crucial use of mathematical 
theorems that deal with quantum-physical notions and that make sense, and have been proven, in the context of 
quantum theory. Therefore, the proof depends only on the soundness of quantum mechanics as a mathematical 
theory. 
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entangled quantum states, where x and y denote which measurements Alice and Bob select and a 
and h are the corresponding measurement results of Alice and Bob. 



X 



a ^ 



P{a,b\x,y) 



y 



- b 



Figure 1: A two-partite system. 

If we have binary inputs and outputs, i.e., X = y = A = B = {0,1}, the behavior of sueh as 
system can be represented by a four-valued function P{a,b,x,y) = P(a,b\x,y) : {0,1}^ [0,1], 
such that X^(ibg{oi} P{0'^^^^y) — 1 x,y & {0, 1}. By the notation P{a,b\x,y) we want to 

indicate that the outputs a, b depend on the inputs x, y. One can view P{a, b\x, y) as a conditional 
probability distribution or a stochastic matrix which assigns to each input/output pair a certain 
probability. 

The behavior on Alice's side alone is described by PA{a,x) = P^(a|x) : {0,1}^ — > [0,1] such 
that X^agjo 1} Pa{ol\x) = 1 for all x G {0, 1}, and similarly for Bob. If P{a, b\x,y) is non-signaUng 
the local behavior of Alice can be computed from the joint behavior by PA{a\x) = PA{a\x,y) = 
'^ljP{a,b\x,y). We call Alice's output bit unbiased if P4(a|a;) = 1/2, for a,x G {0, 1}. We classify 
systems by the correlation they introduce — or, equivalently, by the resource that is required to 
explain the joint behavior of its parts. 

Definition 1. Consider a two-partite system which is characterized by P{a,b\x,y). 

• The system P{a,b\x,y) is independent if there exist PA{a\x) and Pb(6|?/) such that 

P{a, b\x, y) = PA{a\x) ■ PB{b\y) Va, b,x,y e {0, 1} . 

• The system P{a,b\x,y) is local if there exist {P^(a|x)}"^j and {-PB(&|y)}f=i such that 

n 

P{a, b\x, y)=^wi- PX{a\x) ■ P^(6|y) Va, b,x,y E {0, 1} 
1=1 

for some weights Wi > 0, i = 1, ...,n. A system P{a,b\x,y) is non-local if it is not local. 

• The system P{a,b\x,y) is non-signaling if 

P{a,b\x,y) = ^ P{a,b\x,y') Va, x, y, y' G {0, 1} , 
6e{o,i} 6e{o,i} 

^ P{a,b\x,y) = Yl P(('^b\x',y) yb,x,x' ,y £ {0,1} . 

ae{0,l} ae{0,l} 

Consequently, a system P{a,b\x,y) is signaling if it is not non-signaling. 

In terms of classical resources, these categories correspond to no resources at all, shared infor- 
mation and message transmission (i.e., signaling), respectively. In this paper, we focus on systems 
in-between, i.e., which are non-local but at the same time non-signaling. 
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2.2 CHSH-Type Non-Locality 



The only non-locality a system can have if the inputs and outputs are binary, is so-called CHSH-type 
non-locality. A system is CHSH-non-local exactly if it allows for simulating the above-described 
PR-box with a probability superior to 75% [5]. On the other hand, appropriate measurements on 
quantum states achieve (2 + \/2)/4 ~ 85%, but no more than that pT]. It is still an open question 
why quantum physics is non-local, but not maximally so. 

In order to precisely quantify the CHSH-type non-locality of P{a,b\x,y), we first define four 
correlation functions: 

Ecc,y = P{0, 0\x, y) + P(l, y) - P(0, l|x, y) - P(l, 0|x, y) , x, y G {0, 1} . (2) 

Without loss of generality, one can assume that -Eq.O) Eq^i, Ei^ > 0, because there always exist 
local reversible transformations on P{a,h\x,y) which realize these conditions. The CHSH-type 
non-locality of P(a, y) can then be written as 

NL[P{a, h\x, y)] = Eo,o + ^o,i + ^1,0 - • (3) 

Algebraically speaking, the maximal value equation ([3]) can attain is 4, corresponding to Alice and 
Bob computing the relation xAy = a ©6, i.e., simulating perfectly a PR-box. It is not difficult to see 
that the behavior of P{a, b\x, y) with NL[P{a, h\x, y)] € [2, 2^/2] corresponds to a g-approximation 
of a PR-box with q = \ + l- NL[P{a, b\x, y)] G [3/4, (2 + V2)/4]. Therefore, as a 3/4-PR-box can 
be simulated by shared randomness only, the joint input/output behavior of Alice and Bob is local 
if NL[P(a,b\x,y)] < 2. For the rest of this paper we will use the shorter notation g-PR-box for a 
(7- approximation of a PR-box. Furthermore, we will use the notation NL[pab] to denote also the 
maximal attainable non-locality of a bipartite-quantum state pab by appropriate measurements. 

A g-PR-box can be simulated by local quantum processes and shared entanglement between 
Alice and Bob only if g < This is, however, not a sufficient condition: There exist g-PR-boxes 

with q < "^^^ which cannot be simulated quantum-physically [U]. 

In this paper, we will concentrate on PR-boxes with a special error behavior, so called isotropic 
PR-boxes. We call a g-PR-box isotropic if Alice and Bob compute x A y = a (B b with probability 
q and x A y = a © 6 with probability 1 — q. Furthermore, the probability that Alice and Bob 
output a = 0,6 = Oora = 1,6 = 1 is equal for all input pairs, and similarly for the outputs 
a = 0, 6 = 1 or a = 1,6 = 0. More formally, we call a (7-PR-box isotropic if it satisfies the 
following constraints on its correlation functions: -Eq.o = ^0,1 = ^1,0 = —Ei^i > 0. Additionally, 
we need Ebe{o,i} ^(«' ^1^' y) = PAia\x) = 1/2 for all a,x,y G {0,1} and Eae{o,i} ^1^' 2/) = 
Psibly) = 1/2 for all b,x,y £ {0,1}, i.e., the outputs are unbiased. An isotropic g-PR-box is 
quantum-physically realizable if and only if g < (2 + \/2)/4 (see Lemma H]). 

2.3 Non-Locality-Distillation Protocols 

Roughly speaking, a non-locality-distillation protocol is a — classical — bipartite protocol con- 
sisting of g-PR-boxes plus local operations, and having as a whole the behavior of a p-PR-box for 
some p > q. 

An NDP N'q takes as input the bits x and y on Alice's and Bob's side, respectively. The outputs 
of Mq are the bits a and 6, where Mq must have the same input/output behavior as a p-PR-box. 
During the protocol execution, Alice and Bob are allowed to use arbitrary local operations, shared 
randomness, and g-PR-boxes, but are not allowed to communicate. Because a box is non-signaling, 
it can be modeled as two local terminals on Alice's and Bob's side, respectively. A terminal is 
locally described by its input-output behavior. By using this language of terminals, a protocol 
execution consists of Alice and Bob applying local operations and local terminals. Moreover, one 
can easily handle protocols where inputs to boxes are delayed or boxes are intertwined. By the 
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latter, we mean the situation that on Ahce's side, the input to some Box 2 depends on the output 
of a Box 1, whereas on Bob's side, the dependence is opposite, i.e., the input to Box 1 depends 
on Box 2. Note that causahty is not violated since all boxes are non-signaling; boxes can answer 
outputs to inputs locally, even if no input has been given on the other end yet. 

3 Known Limits to Non-Locality Distillation 

Lemma [T] states that it is not possible to create non-locality from locality. 

Lemma 1. (Bell [5]) There exists no non-locality distillation protocol Mq with q < 3/4 and 
p > 3/4. 

Lemma [21 on the other hand, gives a bound on what is quantum-physically achievable (the proof 
can be found in the appendix). 

Lemma 2. (Tsirelson [17J) There exists no non-locality distillation protocol Mq , where the q- 
PR-hoxes must he quantum-physically realizable, with q < ^'^^ and p > ■ 

Note that the argument in the proof of Lemma [2] holds only for quantum-physically realizable 
approximations of PR-boxes. At first sight, one might think that every g-PR-box with q £ [j, ^-^-^] 
can be simulated quantum mechanically. But this is not the case (9j. Actually, there exist (3/4 
PR-boxes, for any < e < 0.125, which can be distilled up to p = 0.875 > Of course, 

these boxes cannot be simulated by quantum mechanics, as otherwise Tsirelson's bound would be 
crossed. Furthermore, there even exist non-isotropic PR-boxes which can be simulated by quantum 
mechanics and can be distilled [12j . 

Finally, by an easy argument one can show that perfect boxes cannot be obtained from imperfect 
ones. 

Lemma 3. Let e > 0. There exists no non-locality distillation protocol Mq with q < 1 — e and 
p = 1. 

4 A New Result on Non-Locality Distillation 

In this section we will prove our main result, Theorem[2l stating that it is impossible to distill more 
than a certain amount of non-locality (which will depend on q) from isotropic g-PR-boxes with 

f < 'Z < "^^4^ ■ Similarly to the proof of Lemma [21 we will use quantum mechanics to prove this 
classical statement. As a preparation, we establish some results about the correspondence between 
CHSH-type non-locality and the fully entangled fraction. 

4.1 CHSH Violation, Fully Entangled Fraction, and Weakly Entangled Mixed 
States 

The CHSH-type non-locality will be our measure of non-locality, whereas fidelity is a measure of 
entanglement, more precisely, of proximity between quantum states, defined by 



(4) 



where p and a are mixed states. Equation @ simplifies to F{p, {ipY) = p\il>), if cj = (V'l 
is a pure state. In particular, we are interested in the quantity 

F{p) = max ($+1 {Ua Ub)p{Ua ® Ub)^ |«>+> , (5) 
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which is called the fully entangled fraction [7j of the two-qubit state p and measures the closeness 
of p to a maximally entangled state. 

Theorem [1] below states that the fully entangled fraction of the following class of states cannot 
be increased noii-int6rcictiv6ly. Wg call tli6in wccikly cTitdTiglcd Tfiixcd states^ dcnotGcl r^a. 

Definition 2. The weakly entangled mixed state with a G [0, 1] is defined as 

1 — cv 1 — r\ 

J7„ = a|cI>+>(ci,+ | + __|00)(00| + ^-|ll)(ll| 



1 + a 
2 









1$+; 







We need some properties of weakly entangled mixed states. The proofs of Lemmas H] and [5] can 
be found in the appendix. First, we show which isotropic q-PR-boxes can be simulated using $7^. 

Lemma 4. An isotropic q-PR-hox can he simulated by 0,a, using appropriate measurements, if 

1 1 / ^ 

Note that the simulation of boxes by measurements on entangled states fits nicely into the 
formalism of boxes described by terminals (see Section l2.3p : A terminal corresponds to a local 
measurement performed on an entangled state. 

Lemma 5. Let 0,a be a weakly entangled mixed state and cr^^ be arbitrary two-qubit quantum states 
and J2ijPij — 1 ^^^^ Pij — 0' then 

ij ij 

where c (a) = a(2 ^2 - 2) - 2^/TTo? + 2. 



4.2 Impossibility of Entanglement as well as Non-Locality Distillation 

In the following, we consider Entanglement Distillation Protocols (EDP) without communication 
[1]. An EDP £ takes as input p®", where p is a (mixed) two-qubit state shared between Alice 
and Bob. The output of an EDP is one two-qubit (mixed) state a. The operation of an EDP 
is denoted by £{p®'^) = a. The performance of the EDP is measured by the fully entangled 
fraction-difference between the output state a and the input state p. Without loss of generality, 
we can assume that the EDP consists of Alice and Bob applying local unitary operations Ua 
and Ub on their input quantum state p^]^"* (ancilla qubits and p*^"). In other words, we have 

£{p®'^) = Tr^5 {{Ua ® Ub)p^ab^^{Ua Ub)^^ where Tr^g means that Alice and Bob trace out 

everything but a single qubit each (this resulting two-qubit state is denoted by a). Then, we 
can prove the following theorem which is mainly based on a result of Ambainis and Yang [Ij. It 
says that it is impossible to distill (without communication) out of many $7^ states a state which 
contains more entanglement than 0^. 

Theorem 1. For any probabilistic non-interactive entanglement-distillation protocol £ , the upper 
bound 

1 -I- (T 

F{£{nT)) < 

holds and is tight. 
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Proof. Assume we have n copies of the state l'^^). Then, we perform a measurement in the 
computational basis on r randomly chosen two-qubit states of |<I>"*")'^"'. Hence, for the measured 
states we get the result |00) and |11) with probability 1/2 each, and the remaining n — r two-qubit 
states stay unchanged. We denote the resulting set of possible states by {l*/"^)}?; = {(S)j=i }) 
where 

1 00) iiv[j]=0 
111) if^;[j] = l 
|<^+) if v[j] = * 

and V £ {0,1,*}". The degree of v, denoted by deg(iT) = r, is the number of j's in {l,...,n} for 
which v[j] 7^ *. It is easy to see that there are (")2^ states in the set of degree r which 

have all the same probability. We claim that the state written as 

nT=(a |^>+> ($+1 + (|00) (00| + 111) (11|)V" , (6) 



is just a mixture of the states G weighted with appropriate chosen probabilities, i.e., 

^r = E(^)«"~^(i-«r-'^?, (7) 

r=0 



where is chosen uniformly at random from the set {|</'^)}i/. This is indeed the case, because the 
quantum state w" can be written as an equal mixture of all possible states of degree r, i.e.. 




and be put into equation which yields 

= Ea"-^ ( E 1'^^^) ] ■ (9) 

r=0 ^ ^ \v:deg{v)=r j 

Comparing ([9]) and ([6]), one can easily see that d?]) must be correct. 

Equation d?]) implies that it is enough to analyze the behavior of the EDP E for the input state 
ijJ^ . Ambainis and Yang have shown in [Ij that 

f+(£:k))<i-^ , (10) 

where i^^(-) is defined by F+(cj) = (<1*^| cr |^^). It is not difficult to show that there exists another 
EDP, denoted by for any input p®"" , such that F{£{^p®'')) = F+ {£+ {p®"^)) . Therefore, equation 
(fToD is equivalent io F{£{u'^)) < I - ^. Together with d?]), we get 

which proves the statement. Taking the first of the n two-qubit states proves tightness of our bound 
(see Proposition [2] in the appendix). □ 

Note that it is crucial for this proof that the entangled input states are J^Q-states. If, for ex- 
ample, the input states were pure, there would exist an entanglement distillation protocol without 
communication which would actually increase the fully entangled fraction. The states 0^ are, 
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actually, the first class of two-qubit states for which it is provably impossible to distill entanglement 
without communication. It is an interesting open problem to find other states with this property. 

Before we prove our main result, we state a well-established result about the simultaneous 
block diagonalization of the projectors corresponding to two projective measurements with binary 
outputs [Is] . 

Lemma 6. Let Pq, Pi, Qq, Qi be four projectors acting on a Hilbert-space 7i such that Po + Pi = I 
and Qo + Qi = I- There exists an orthonormal basis in TC where the four projectors Pq, Pi,Qo, Qi 
are simultaneously block diagonal, in blocks of size 1x1 or 2 x 2. 

For completeness reasons we provide a proof of Lemma [6] in the appendix (taken from [15j). 

Theorem 2. There exists no non-locality distillation protocol Nq , where the q-PR-boxes must be 
isotropic, for any g € [|, ^-^j^] and p > q + g{q) with 

g{q) = \-q+^^^^m-l?-^- 

Figure [2] shows which p-PR-box could at most be distilled out of g-PR-boxes. Note that g(3/4) = 
and 5((2 + \/2)/4) =0. 

Proof. The proof is by contradiction. Assume there exists an NDP which uses isotropic g-PR- 
boxes and has the same input/output behavior as a p-PR-box such that p > q + g{q) for some 
Q £ [|, '^^/^ ]- As argued in Section [2.31 every NDP can be represented by local operations and 
terminals on Alice's and Bob's side, respectively. Let P{a, b\x, y) denote the input/output behavior 
of the NDP. 

The local operations can be represented by local unitary operations and the terminals can be 
simulated by appropriate measurements on f^Q, states (see Lemma H}. It is always possible to 
simulate the terminals, because the input boxes are isotropic g-PR-boxes with <? € [|, ^^^^ J. 

By introducing ancilla qubits, all intermediate measurements can be postponed to the end of 
the quantum circuits. Therefore, the quantum circuits can be represented by unitary operations 
C/J and U^, followed by measurements on the resulting states, where x and y are Alice's and Bob's 
input bits to the NDP. As the output of the NDP is one bit on Alice's and Bob's site, respectively, 
the measurements after the unitary operation on Alice's (Bob's) site can each be represented by 
two positive operator-valued measurements (POVMs) with binary output. We need two POVMs 
on each side because the measurements can depend on the input bits x and y, respectively. By 
introducing ancilla qubits, the POVMs can be replaced by projective measurements with binary 
outputs (Naimark's dilation theorem). We denote the resulting measurements by {Pj^'°'}a and 
{P^'^}b, respectively (a and b denote Alice's and Bob's output bit of the NDP). By merging the 
unitary operations U^, and the measurements {P^'"'}a, {PB^}b one gets the measurements 

{Pr}a = m ■ PT ■ {Ul)^}a , X G {0, 1} (12) 

{py/}, = {ui . py/ . (ui)% , y e {0, 1} (13) 

applied on the input qubits (i.e., the Qa and ancilla qubits). 

Lemma [6] tells us that there exists a basis such that the measurement operators of equation (jl2p 
can all be written in the same block diagonal form, with blocks of size at most 2x2. Let us denote 
these transformed measurements by {Pj^'"'}a, with x G {0, 1}, and the corresponding unitary basis 
change from the standard to the block diagonal basis by U'^°'^ . Note that U'^°'^ does not depend on 
Alice's input bit x to the NDP because it is the same basis transformation for all four projectors. 
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The same procedure can be applied on Bob's side, yielding block diagonal measurement operators 
{P^'''};,, with y G {0, 1}, and unitary operation C/^*"^. 

Let be a POVM where E\ is a projector onto the subspace of the simultaneous ith 

diagonal block of the four projectors P^'" with x,a G {0,1}. If TLa denotes the whole Hilbert 
space on Alice's side and T-L\ is the two dimensional subspace of TIa on which E\ projects, we can 
write the space TLa as a direct sum of orthogonal subspaces, i.e., TLa = ©j'^A- Furthermore, the 
projector E\ which acts on the whole space TLa can be decomposed as 

^^ = o©o©...eo©£;^©o©...eo , (i4) 

where E\ : TL\ TL\ is at the iih position in the direct sum and is actually just the identity on 
TL\. Similarly for Bob. 

If we measure the POVMs {-E^jj and on Alice's and Bob's side, respectively, after the 

unitary operations U'^"'^ and C/^'"^ but before the measurements {P^°'}a and {-Pg'^lfe, the outcome 
statistic of Alice and Bob does not change. In order to see this, we compute the corresponding 
input /output statistic of Alice and Bob. Let us denote the state of Alice and Bob right after the 
unitaries U'^°'^ and C/^*"^^ by pab- The mixed state after the measurements {-E^jj and is 
then given by 

~P^B = E Y.^^A ® 4) • PAB ■ {E\ ® 4) = Y.Pi^ ■ Pab > (15) 

i j ij 

with PAB on TLa®TLb andpij = Tt{(E\®E^^)pab) and where p^^ = 0©0©...©0©p^^©0©...©0 
are the normalized post-measurement states with on TL\ © W'^, i.e., p'j^g is just an embedding 
of into TLa 'S'TLb- Using the cyclicity of the trace and the fact that 

PT = Y.^A-PT-E\ Vx,aG{0,l}, P|'^ = ^4.P|'^4 Vy,feG{0,l}, (16) 

i j 

one can easily show that: 

P{a, b\x, y) ^ Tr f (P^?'" © ^) • pab) = Tr f (P^''^ © Pf ") • pab) Vx, y,a,be {0, 1} . (17) 



A ^ B ; HA^ J - '^'^ A B 

Therefore, we do not change the input /output statistic by these intermediate measurements. 

In the following we will try to map all information onto just two qubits so that we can apply 
Theorem [T] which says the the fully entangled fraction of any two qubit output state of any entan- 
glement distillation protocol does not have a larger fully entangled fraction than the input states, 
if the input states are ri^'s. We will do this mapping in a clever way such that the fully entangled 
fraction of this two qubit state is equal to the convex combination of the fully entangled fractions of 
the states in the convex decomposition (see equation (j26]) ). In order to achieve this we will proceed 
as follows. 

Let us define new projective measurement operators on the individual subspaces by 

Q'f''' := E\.PY-E\, (18) 
Q^^y'' := 4.Pf^4, (19) 

for all X, y, o, 6 G {0,1} and Q^""'" = 0©0©...©0©Q^^'"©0©...©0 with Q^^'" acting on H^. Then, 
by using equations ([15]) and p!7|) . the cyclicity of the trace and the idempotence of projectors, we 
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get 

Pia,b\x,y) = Tr((Pr 



= Y.P^r Ppj^i<^Mx.y) , (20) 

where P ij (a, b\x, y) denotes the input/output statistic resulting from measuring the state p'ln by 

Pab 

the measurement operators {Q^^'" ® Q-'^' }a,h- 

By using Lemma [5] and the fact that mixing input / output statistics cannot increase the non- 
locality we get 



NL[P{a,b\x,y)] < Y.PirNL[P^j^{a,b\x,y)] 

< Y.P^rNL[4^] 

< NL[Qa] + c{a) . (21) 

In order to be allowed to apply Lemma [5] in the last inequality of equation ()2ip we need to prove 
that 

F{n^)>Y,p^rF{p%B) ■ (22) 

ij 

This can be shown as follows. Let us introduce a completely positive map (CPM) M : TIa^'Hb 
Hi^n§, where and are single qubit Hilbert spaces, which we define by Ai{pAB) = 
T.ij{U\®Ui)pAB{U\®Ui)\ where the operators U\:Ha^ and W^-.Hb^ are defined 
as 

u\ = o©oe...©o©;7jieo©...eo , (23) 

Uj^ = 0©0©...©0©[/|j©0©...©0 , (24) 

where U\ : n\ ^ and : ^ are unitary operations on single qubit subspaces on 
which E^^ and E''^ project, respectively (the CPM Ai is actually a trace preserving operations as 
Yliji^A ® ^b)^ ' i^A ® ^b) ~ How we choose these unitary operations will be explained later. 
As pab is the state after the measurements and it has block diagonal structure and 

therefore, if we apply the CPM M to it, we get the following two qubit state 

Pab = M{pab) = Y.^U\0Ui)-pAB-{U\0Uj,y 

ij 

= T.P^riUA^uj,)■p^i^■iWA^ui,)^ . (25) 
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Computing the fully entangled fraction of pab yields 

F{pab) = max($+|(C/^0?7ij)pAB(C/A»?7ij)t|$+) 
Ua,Ub 

= Y^pi, ■ max ($+|(C/^ UbWX ui)f/ij,iU\ ui)\UA C/b)^|$+) 

= (26) 

ij 

where we have chosen the U^s and f/^'s in such a way that they maximize the fidelity for each 
and we therefore get the fully entangled fraction F{p^j^^) for each two qubit state p^^- Furthermore, 
as the CPM M can be replaced by a unitary operation by introducing ancilla qubits we know, due 
to Theorem [H that the fully entangled fraction of pab is smaller or equal than the fully entangled 
fraction of Q.a-, i.e., F{Q.a) > F{pab)- Together with equation (p6|) this argument finishes the proof 
of equation (p2]) . 

Hence, we have proved equation (j22p and therefore also equation (j2ip . This implies that the 
non-locality of the NDP, denoted by NL[P{a,b\x,y)], is smaller or equal than the non-locality 
of the input state plus some term c(a). By using the the fact that NL[Qa] = 2\/l + (see 
Proposition [1] in the appendix) and the relation (7=1/2 + 1/8 - A^L which is a rescaling of 
the non- locahty from the interval [2, 2^/2] to [3/4, (2 + \/2)/4], we can re-express equation (|2T]) by 
P < (1 + 9{q)- But this is in contradiction to our initial assumption, which demanded that the NDP 
has an input /output statistic which is more non-local than q + g{q)- □ 

Figure [2] shows the limited distillability of g-PR boxes. The largest gap can be found at q = 

\ + \\I^^T^ ^ ^-^^^ which results in a gap of around 5(0.775) « 0.0225. Note that the only 
possible consistent way of how non-locality could be distilled must have step function form. First, 
it is clear that out of {q + e)-PR-boxes (with e > 0) one can always distill as much non-locality as 
of g-PR-boxes. Second, assume that we can at most distill a (g + e)-PR-box out of g-PR-boxes. If 
we now could distill a {q + e + 7)-PR-box out of m {q + (5)-PR-boxes, with (5 < e, and could distill a 
(g + (5)-PR-box out of n (?-PR-boxes, we could simulate a ((7 + e + 7)-PR-box with n-m g-PR-boxes. 
But this is a contradiction to our assumption that at most a (g + e)-PR-box can be distilled out of 
(7-PR-boxes. 

Furthermore, we get an immediate corollary out of Theorem [2l namely: 

Corollary 1. There are infinitely many q G [3/4, (2 + \/2)4] for which it is impossible to distill a 
p-PR-hox out of isotropic q-PR-hoxes for any p > q. 

Proof. That there are infinitely many points where non-locality distillation is impossible follows 
from the fact that any non-locality distillation protocol has the above described step function 
feature. This is the case because the curve q + g{q) and the line q are getting closer and closer for 
larger q and finally meet at the point (2 + \/2) /4. □ 

5 Concluding Remarks 

The joint behavior of the two parts of a bipartite input-output system is non-local if it cannot be 
explained by shared (classical) information. The interest of non-locality comes from two facts: First, 
it is real, i.e., appears as the measurement-outcome behavior of certain entangled quantum states 
— even if the measurement events are space-like separated. This does not formally contradict 
relativity as long as the system is non- signaling, i.e., does not allow for message transmission. 
However, it contradicts the spirit of relativity, which is the reason why Einstein et al. [llj had 
asked for local explanations. Second, it is useful for information processing, e.g., for reducing the 
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Limited Distillability of g-PR Boxes 




Figure 2: The maximal amount of distillable non-locality is shown. For given isotropic g-PR-boxes 
at most a p-PR-box which has a p below the top curve q + g{q) could be distilled. The step 
function plotted between the upper limit curve and the dotted non-distillability line shows one 
possible behavior of how much non-locality could be distilled for given Q'-PR-boxes. 

communication complexity or for achieving unconditional confidentiality. Since non-locality is more 
useful if it is stronger, non-locality amplification is a natural goal. 

Only little is known on whether non-locality amplification was possible: Two bounds cannot 
be over-stepped, namely the Bell and Tsirelson bounds. On the other hand, perfect non-locality 
cannot be obtained from imperfect. Here, we show the first result stating that only limited distilla- 
bility could be possible for a large class of systems, namely isotropic quantum-physically realizable 
approximations to PR-boxes. Our results imply that there are infinitely many g G [3/4, (2-|-\/2)/4] 
where non-locality distillation is impossible at all. 

It is somewhat remarkable that our result, which talks only about classical systems and cir- 
cuits, has a "quantum-physical proof": It makes (central) use of facts from quantum theory and 
information science. An example is a novel result stating that the fully entangled fraction of a class 
of mixed entangled states cannot be increased non-interactively at all. This is the first result of 
this strength, and it is a challenging open problem to find other states with this property. 

Our result complements [12], where it is shown that certain non-isotropic non-local systems 
can be distilled. This may suggest that the amount of CHSH violation is a good measure for 
non- locality only for isotropic systems. We state as an open problem to find a general proof that 
isotropic non-locality can never be amplified. 

Acknowledgments. We thank Nicolas Gisin, Renato Renner and Jiirg Wullschleger for many 
helpful discussions, and an anonymous referee for pointing out to us the reference about simultane- 
ous block-diagonalization of two bipartite measurements which we could apply in order to correct 
an earlier version of this paper. 
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APPENDIX 



A Proof of Lemma [2] 

Lemma 2. (Tsirelson [17J) There exists no non-locality distillation protocol Mq , where the q- 
PR-boxes must be quantum-physically realizable, with q < ^"^^^ and p > "^^^ ■ 

Proof. The proof is by contradiction. Assume there exists an NDP which uses quantum-physicahy 
obtainable g-PR-boxes and has the same input /output behavior as a p-PR-box such that p > ■ 
By assumption, the behavior of the g-PR-boxes can be obtained by measuring some entangled 
states. It can be shown that every such circuit which consists of classical local operations, shared 
randomness, and measurement operators on entangled quantum states is equivalent to some Her- 
mitian measurement operators X^, X^, Yg, and operating on these entangled quantum states 
and ancilla qubits. The notation means that if Alice's input bit is i, she applies the observable 
on her part of the entangled and ancilla qubits; similarly for Bob. The observables X^, X^, Yg, 
and have binary output (but arbitrary finite input alphabet), which corresponds to the output 
of the NDP. Tsirelson has shown in [T7] that for any observables of finite dimension with binary 
output which are applied on any entangled state, the maximal CHSH non-locality that can be 
achieved is 2\/2, which corresponds to a ^^-PR-box. Therefore, the NDP we have constructed, 
and transformed into Hermitian measurement operators on entangled quantum states, must also 
obey this bound. □ 

B Proof of Lemma [4] 

Proposition 1. The jiOTi-locdlity of 



NL[nc,] = 2vr+li2 . 

Proof. Assume we have given a Bell diagonal mixed quantum state 

p = pi|$+)(<5+|+p2|$">($"|+P3|^+>(^+|+P4|^">(^"| . (B-1) 

Using the results from [14] one gets for the maximal CHSH violation of p the following value 



NL[p] = 2VA7+A2 (B-2) 

where Ai and A2 are the two larger values of the following three expressions 

(2 • {pi + Ps) - If , (2 • {p2 + P3) - If , (2 • (pi + P2) - If . (B-3) 

As for r^Q, we have pi = (1 + q) /2, P2 = (1 — a) /2 and ps = P4 = 0, the two larger values of equation 
()B-3P for Ai and A2 are 

Ai = a , 



A 



2 



Hence, the maximal CHSH violation for becomes 

Lemma 4. An isotropic q-PR-box can be simulated by Q^, using appropriate measurements, if 



1 1 / n 
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Proof. The state ^la has non-locality of NL[Qa] = 2Vl + when appropriate measurement op- 
erators are chosen by Alice and Bob (see Proposition [T|) . We will see that it suffices to consider 
measurements in the real plane to achieve this non-locality. Therefore, such measurements can be 
described by observables of the form 

M{v) = {v,a), (B-4) 

where v = (cos((/)), sin(0))-^ G is a real unit vector (the measurement direction) and a = 
{(Tx,o'z)'^ , where ax and az are Pauli matrices. Let us define the following observables for Alice: 

M{ao) = az , M{ai) = ax (B-5) 

and for Bob, 

-> en 1 ^ — Oi 1 

^^^o) = rr^ ''^ + rr^ ''^ ' ^(^i) = /— + r—^ ^^z ■ (B-6) 

vl + a vl + a vl + a^ yl + a 

As these four Hermitian measurement operators have the eigenvalues -|-1 and —1, each of them can 
be written as the difference of two projectors, namely M[ax) = — P^^ and M{hy) = P^^ — P^^ . 
Hence, the measurement statistic of the joint system of Alice and Bob becomes 

P(a,6|:E,y) =Tr((P^''^0Pf')-J7,) Vx, y, a, & G {0, 1} . (B-7) 

To compute the non-locality of P(a,6|x,y) we need to determine the correlation functions (see 
equation ([2])): 

E^^y = P(0, 0|x, y) + P(l, l|x, y) - P(0, l|x, y) - P(l, 0|x, y) , x, y e {0, 1} . (B-8) 
By using ()B-7p and the linearity of the trace one gets 

Ex^y = Tv({M{ax)®M{by))-Vlo}j . (B-9) 

By some easy calculations, one gets for the four correlation functions the following expressions 
1 1 

This gives for the non-locality of P(a, 6|x,y) (see equation dS])), which corresponds to measuring 
the state Via by the measurement operators M{ax) and M(by), for x,y £ {0, 1}, a value of 

NL[Pia, b\x, y)] = NL[n^] = Eo,o + ^o,i + ^i,o - = 2^1+^ . (B-11) 

Lemma [T] tells us that this is the maximal possible value we can get for the non- locality of r^Q,. 
Hence, the measurements we defined in equations ()B-5P and ()B-6P extract as much non-locality out 
of Oq, as possible. 

Using q = 1/2 + 1/8 • NL[^}a], which is just a rescaling of CHSH-type non-locality from the in- 
terval [2,2^/2] to [4, y ] , we get the upper bound given in the lemma. But the joint input/output 
behavior P(a,b\x,y) of Alice and Bob does not yet describe an isotropic PR-box. 

Two requirements have to be fulfilled by P{a, b\x, y) such that it corresponds to an isotropic PR- 
box. Namely, the local output by Alice and Bob must be unbiased, i.e., PA{a\x) = PB{b\y) = 1/2 
for all X, y,a,b £ {0, 1}, and the correlation functions must satisfy Eq^ = £^o,i = = —Ei^i (see 
Section [2.2p . Computing the local input/output behavior of Alice (and similarly for Bob) yields 

PA{a\x) = Tr (P^'^ • TvBin^)) = Tr (P^''^ • 1/2) = ^Tr(P_^''^) = \ Vx, a G {0, 1} (B-12) 
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where Tr^ means taking the partial trace over Bob's system and / is the identity matrix of dimension 
two. Therefore, the input /output behavior of Ahce (Bob) is unbiased. In order to get the correct 
form for the correlation functions, Alice and Bob apply togetheiH the following two operations, 
each with probability 1/2: 

1. Alice and Bob do nothing, or 

2. Alice flips her input bit x and Bob flips his output bit h if his input bit y is equal 1. 

It is not hard to show that Alice and Bob induce the following transformations on the four corre- 
lation functions by applying step 2. 

-£■0,0 Eifl , £"0,1 —Ei^i , Eifi Eofi , Ei^i —Eq^i . (B-13) 

The resulting correlation functions are then (note that the local input/output behavior has not 
been changed by this process) 

^0,0 — ^ , — ^ , -C/1,0 — ^ , ^1,1 — 2 ■ [0-14:) 

As £^0,0 + -£-0,1 + ^1,0 — Ei^i is still 2VT+"a^, we did not lose any non-locality by this process. 
Furthermore, we now have an isotropic PR-box, as desired. □ 



C Proof of Lemma [5] 

Proposition 2. The fully entangled fraction of Qa is 

Proof. In order to find F{0,a), we have to find unitary operations Ua and Ub which maximize 
(<1>+| {Ua UB)''^a{UA ® Ub) \^^)- Let us define the joint unitary operation by 

{Ua ® Ub) \<^+) = \^) = aoo |00) + aoi |01) + aw |10) + an |11) (C-1) 

where aoo, ooij ^lo and an are complex numbers. Computing (^|ilQ,|^) yields the following 
equation after some easy calculations 

(^-l 0„ 1^) = ^ • (kool^ + + a ■ {a*oo ■ an + a*^ ■ aoo)) ■ (C-2) 

What is the maximal value this equation can attain? As aoi and aio do not appear in the equation, 
they can be set to 0. Therefore it must hold that |aooP + lonp = 1 because of the unit length 
of quantum state vectors. As |aooP + = 1, it is easy to see that equation (\C-2\i attains its 

maximal value when aoo = oii = Hence, this equation becomes maximal when 1^*) = 1$"^) 
and therefore the fully entangled fraction of becomes 

F{n^) = (<I>+| 0„ \<^+) = , (C-3) 

which finishes the proof. □ 

Proposition 3. Let p be an arbitrary two qubit quantum state with ^ < F{p) < 1. Then the 
non-locality of p is upper bounded by 

NL[p] < 2^1 + (2F(p) - 1)2 . 
^hence, they need one bit of shared randomness for this procedure 
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Proof. We will actually prove a stronger result, namely that 0,^ is maximally non-local for given 
fully entangled fraction, i.e., F(Qa) ^ -^(p) =^ NLlTla] ^ ^L[p]. If we can prove this result, the 
proposition follows, because F{^}a) = and NL[i}a] = 2Vl + a^, due to Proposition [1] and [21 
respectively. 

An arbitrary two qubit quantum state can be written as 



1 / 

- il^I + f-a^I + I^s-a+^Ti. 



(Ti aj (C-4) 



where / stands for the identity operator, f,s£ and f ■ a = Ylf=i '''i'^i with {cjj}?^]^ being the 
Pauli matrices and Tij £ [—1, 1]. 

In the following we are mainly interested in the real 3x3- matrix T which is defined as 
Tij = Tr(p(c7j (8) (Jj)), because the fully entangled fraction and the non-locality of p depend only 
on the coefficients Tij as shown by the Horodecki family and Badzicig in [14J and [2\. As shown in 
[2] one can find local unitary operations Ua and Ub such that pd = {Ua <E) Ub)p{Ua 53 Ua)'^ has a 
diagonal matrix T, i.e., T = dza5(Tii, T22, Tss). 

The non-locality of pd, measured by the maximal violation of the CHSH inequality, is given by 

m 



NL[pd] = 2^Tl+Tl (C-5) 

where we assumed without loss of generality that Tn and T22 have the larger absolute value than 
T33. In order for pd to be non-local, we need Tf^ + T22 > 1, which implies \Tii \ + IT22I > 1. By 
using the following two implications from [2], 

|Tii| + |r22| + |r33| >i F{pd)>^, (C-6) 

Fipd)>l det(T)<0, (C-7) 

we can conclude that det(T) = Tn • T22 • T33 < if is non-local. Hence, because we are only 
interested in non-local states we must have det(T) < 0. Furthermore, the unitary operations Ua 
and Ub can be chosen in such a way that Tii,T22 < 0. Together with det(T) < we can conclude 

that rii,r22,r33 < o. 

As we assume that T is diagonal, the singular values of T are simply |Tii| , IT22I, and IT33I. The 
fully entangled fraction of pd is then given by [2] 

F{pd) = ^ (1 + iT^iil + 1^22! + IT33I) , (C-8) 

where we assumed that pd is non-local, i.e., det(T) < 0. 

It is easy to see that NL[p] = NL[pd\ because the local unitary operations do not change the non- 
locality of a given state. Additionally, it is also true that F{p) = F{pd) because the fully entangled 
fraction of p is not changed by local unitary operations. Hence, if we prove that -F(0„) > F{pd) 
implies NLlTla] ^ ^L[pd], the result itself is proved. Furthermore, using Propositions [Hand [2] and 
equations ()C-5P and (jC-Sp . it is enough to prove 

> ^ (1 + iTnl + |r22| + |r33|) 2VT+^ > 2^Tl,+Tl . (C-9) 

In order to be able to prove equation (IC-OP , we first need to derive restrictions on the coefficients 
of the matrix T, because not all matrices T with Tij £ [—1,1] correspond to a valid quantum state, 
i.e., a density matrix which is a positive operator with trace 1. An operator pd is positive if and 
only if 

{yp\pd\y^)>0 y\ip) . (C-10) 
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Therefore, we must also have 

{^+\ Pd\^+) > . (C-11) 
Simple calculations show that (|C-lip implies 

T33 < 1 + Tn + < , (C-12) 

where the second inequality is an implication of Tn +T22 < —1 (which follows from |Tii| + IT22I > 1 
and Tu,T22 < 0). 

Solving the left-hand side of equation ()C-9P for a, and using Tn, T22, T33 < 0, we get 

a> -^(i + rn + r22 + r33) . (C-i3) 

It is easy to see that 1 + Tn + r22 + r33 < if we demand to be non-local and, hence, a > 0. 
Hence, we can conclude that 

l + a2 > i + i(i + Tn + T22+T33)2 . (C-14) 

By putting equation ([C^ into ([C^ . we get 

l + a^ > 1 + (1 + Tn + r22)^ 

= 2 + 2Tn + 2T22 + 2TnT22 + Tf^ + T^^ 

= 2((i + rn)(i + r22)) + r2i + 

> Tl + TI2 . (C-15) 

This holds because of I + T11+ T22 T33 < in equation (foni) and I + T11+ T22 < in (1012]) 
and (|C-15p . The last inequality comes from the fact that Tn > — 1 and T22 > — 1- Hence, we have 
proved that 1 + > T^^ + T22 and therefore equation ()C-9P . □ 

Figure [3] shows the correspondence between the fully entangled fraction and non-locality of 
states. Note that pure states have exactly the same relation. 

Proposition 4. Let p be an arbitrary two qubit quantum state with | < F{p) < ^. Then the 
non-locality of p is upper bounded by 

NL[p\ < 4F(p) . 

Proof. We will use the same notation as in the proof of Proposition [3l As F{p) is smaller or equal 
than 1/2, it can be shown p] that p must be local and therefore the fully entangled fraction can 
be written as [2] 

F{p) = Ul+ max(|r,i| + \Tj,\ - \Tkk\)] , i,j,k e {1,2,3} . (C-16) 

Without loss of generality we can assume that Tn and r22 have a larger absolute value than T33 
and we can therefore write for the fully entangled fraction of p the simpler expression 

F{p) = ^ (1 + |Tn| + 1^22! - |r33|) . (C-17) 
Furthermore, as F{p) < 1/2 implies [2] that \Tii\ + IT22I + IT33I < 1, we get 

|Tn| + |r22| <2F(/5) . (C-18) 
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Maximal CHSH Violation for given Fully Entangled Fraction 




Fully Entangled Fraction F 



Figure 3: The maximal CHSH violation for given fully entangled fraction is plotted for pure states 
and weakly entangled mixed states Oq, (see Definition [2|) . All possible quantum states are located 
below the curve corresponding to pure states and Qa states. 



As we have assumed that Tn and T22 have a larger absolute value than T^^, we get for the non- 
locality of p (see proof of Proposition [3l) a value of NL[p] = ly^T^^+T^. Hence, we can upper 
bound the non-locality of p by 



NL[p] = 2^Tl, + < 2(|Tn| + IT22I) < 4F(p) , (C-19) 
which finishes the proof. □ 

Lemma 5. Let be a weakly entangled mixed state and a^^ be arbitrary two-qubit quantum states 
and J2ij Pij — 1 ^^^^ Pij ^ 0; then 

Fi^a) > Y,Pi3 ■ P^^'') ^ NL[VLc,] + c(a) > Y^Pij ■ NL[a'^] , 

ij ij 

where c{a) = a(2\/2 - 2) - 2Vl + + 2. 

Proof. We can rewrite the statement by using Propositions[T]and[2]as follows: if = ^kPk'F{(7^) 
then 2^/l + a'^ + c{a) > "^^kPk ■ NL[a^] with "^^Pk = 1 ^-nd pk > 0. Without loss of generality, we 
assume that for the first m terms in the sum, with m G {0, 1, ra}, it holds that 1/4 < F{a^) < 1/2, 
for k G {1,2, ...,m}, and for the remaining terms 1/2 < F{a^) < 1, for k G {m + l,...,n}. 

Let us make the following substitutions. If 1/4 < F(a^) < 1/2 then set Xk = 2F{(t^) - 1 G 
[-1/2,0) and if 1/2 < F{a^) < 1 then set yk = 2F{a'') - 1 G [0, 1]. In order to prove the lemma 
we need to find the minimal value c{a) must have such that the statement is true. By using 
Propositions [3] and U] we can write down the following constrained optimization problem 



max c(a) = -2x/l + a2 + Vp^ • 2 • (1 + x^) + V Pk-2-Jl + yl 



n 



subject to: a = '^Pk-Xk+ ^ Pk ■ Vk , Xk £ [-1/2,0) , G [0, 1] , 

k=l k=m+l 
n 

1 = J^Pfc , Pk>0. (C-20) 

k=l 
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One can show that this program is maximized for pk = for all k G {1, 2, m}. Furthermore, only 
two terms are needed in the second sum, i.e., Pm+i = P and Pm+2 = 1 — p- In order to maximize 
c(a) we have to set ym+i = 1 and ym+2 = 0. Putting these values in the first constraint gives us 
for p the following value 

p = a . (C-21) 
Inserting all these values into the objective function of the optimization problem results in 

c(q) = a{2V2 - 2) - 2 Vl + + 2 , (C-22) 

which proves the lemma. □ 



D Proof of Lemma [6] 

The proof of Lemma [6] is taken from [15j . 

Lemma 6. Let Po, Pi, Qq, Qi he four projectors acting on a Hilbert-space 7i such that Pq + Pi = I 
and Qo + Qi = I. There exists an orthonormal basis in 7i where the four projectors Pq, Pi,Qq, Qi 
are simultaneously block diagonal, in blocks of size 1x1 or 2 x 2. 

Proof. As the three operators Qo, {QqPoQo) and {QqPiQo) do all commute with each other, they 
can be simultaneously diagonalized [13]. Let \v) be one of their simultaneous eigenvectors. As 
Qo and Qi project onto complementary orthogonal subspaces and Qo\v) = \v), we conclude that 
Qil^;) = 0. Furthermore, as Pq + Pi = /, it cannot be the case that Pq\v) = Pi\v) = 0. 

If Po|f) = then Pi\v) = \v) = Qo\v) and the span of \v) (denoted Ey) corresponds to a 1 x 1 
diagonal block in which Pq, Pi,Qq,Qi have eigenvalues 0,1,1,0, respectively. The case Pi\v) = 
is similar. 

Consider the case where Po\v) 7^ and Pi\v) 7^ 0. Define the orthogonal vectors \ai) = Po\v), 
I02) = Pi\v), and the two-dimensional subspace = {ai\ai) + a2|a2) : Vai,a2 G C}. The fact 
\v) = \ai) + {02) implies \v) £ Ey. We have 

Qo\ai) = QoPo\v) = QoPoQo\v) = Xi\v) , 
Qo\a2) = QoPi\v) = QoPiQo\v) = h\v) 

because \v) is an eigenvector of QqPoQo and QqPiQo- The vector \w) = (1/Ai)|ai) — (1/A2)|a2) is 
therefore an eigenvector of Qo and Qi, i.e., Qq\w) = and Qi\w) = \w). Summarizing, the vectors 
I02) G Ey are simultaneous eigenvectors of Pq, Pi and the vectors \v),\w) £ Ey are simultaneous 
eigenvectors of Qq, Qi. Therefore, the subspace Ey corresponds to a 2 x 2 simultaneous diagonal 
block for Pq, Pi, Qo, Qi- The same can be done with the rest of simultaneous eigenvectors \v) as 
defined above. And analogously, for the simultaneous eigenvectors of Qi, (QiPqQi) and (QiPiQi) 
which are orthogonal to the vectors \w) that have appeared in the previous steps. At the end, 
the direct sum of the subspaces Ei,E2, ... is TC; each subspace Ei of dimension two contains two 
eigenvectors of each operator Pq, Pi, Qoi Qi- D 
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